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Abstract 

Dimension reduction is a fundamental problem in the study of dy- 
namical systems with many degrees of freedom. Extensive efforts have 
been made but with limited success to generalize the Zwanzig-Mori pro- 
jection formalism, originally developed for Hamiltonian systems close to 
thermodynamic equilibrium, to general non-Hamiltonian and far-from- 
equilibrium systems. One difficulty lies in defining an invariant measure. 
Based on a recent discovery that a system defined by stochastic differential 
equations can be mapped to a Hamiltonian system, we developed a pro- 
jection formalism for general dynamical systems. In the obtained general- 
ized Langevin equations, the memory kernel and the random noise terms 
are connected by generalized fluctuation-dissipation relations. Lacking of 
these relations restricts previous application of the generalized Langevin 
formalism in general. Our numerical test on a chemical network with end- 
product inhibtion demonstrates the validity of the formalism. We suggest 
that the formalism can find usage in various branches of science. Specif- 
ically, we discuss potential applications in studying biological networks, 
and its implications in network properties such as robustness, parameter 
transferability. 

Send correspondence to: jxing@vt.edu 



It is common to study dynamics of a system with many degrees of freedom in 
almost every scientific field. In general it is impractical, and often unnecessary, 
to track all the dynamical information of the whole system. A common practice 
is projecting the dynamics of the whole system into that of a smaller subsys- 
tem through information contraction. The procedure leads to the celebrated 
Langevin and generalized Langevin dynamics. The Zwanzig-Mori formalism is 
a formal procedure of projection, especially for Hamiltonian systems [TJ [2j [3l [4] . 
Inspired by its great success in irreversible statistical mechanics, Chorin and 
coworkers, have suggested a version of the Zwanzig-Mori formalism for higher- 
order optimal prediction methods for general dynamical systems [5J [B], The 
difficulty lies in choosing an invariant measure in defining an inner product (see 
below for details) . The choice is straightforward for a Hamiltonian system, but 
not clear for a general system. 

Recently one of us has proved that one can map a system described by a set 
of stochastic differential equations 

M 

±i = dxi/dt = G;(x) + ^2 9ij{x)(j(t), i = 1, • • • ,N. (1) 

to a Hamiltonian system [7]. In general M and N may be different, &(t) are 
temporally uncorrelated, statistically independent Gaussian white noise with 
the averages satisfying < Ci(t)Cj( T ) > = Sij8{t~ t), g(x) is related to the N x N 
diffusion matrix gg T = 2D//3, where the superscript T refers to transpose. For 
a statistical mechanical system ft is the inverse temperature, 1/fcgT with ks the 
Boltzmann's constant. For a non-physical systems, ft is a parameter analogous 
to an effective inverse temperature. Eqn. [T]is widely used to describe dynamics 
in various fields of science from physics, ecology and cell biology, finance, geology, 
etc [51 110) . The mapping makes explicit the choice of an invariant measure, 
and thus derivation of the Zwanzig-Mori projection formulae straightforward. 
The mapping is only used as an auxiliary tool to derive the formula. One needs 
not to actually construct the mapping Hamiltonian. 

In the remaining parts of the paper, we will first develop the theory, then 
present analytical results for a quasi-linear system and numerical tests on a 
small chemical network, and finally conclude with a general discussion. 

1 Theory 

1.1 Summary of the Zwanzig-Mori formalism 

We follow the notation of Zwanzig here j3J. Consider a dynamic variable de- 
scribed by a Liouville equation in a N dimensional space, 

JU(x(t),t)=L>l(x(t),t) (2) 
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For a Hamiltonian system, the Liouville operator L is defined as, 



la-^tI— — - — — ^ 

V dpi dqi dqt dpi J 

where H is the Hamiltonian, and qi and pi are the coordinates and conjugate 
momenta. Any dynamical quantity A(q, p) is a vector in the Hilbert space. One 
can define the projection of another dynamical quantity B on A as , 

PB = ^2(B,A i )(A,A)^ 1 A j ,i,j = l,-" ,m (4) 



where m is the number of components of the vector A. The inner product for 
two arbitrary variables A and B is defined as, 

(A, B) = < A*B >= J A^Bip(x)dx (5) 

where f means taking transpose and complex conjugate, and ip is a weighting 
function need to be defined. Any dynamic variable within the subspace can be 
expressed as a linear combination of the basis functions. The projected equa- 
tions of an arbitrary dynamic variable A, which is defined within the projected 
subspace, are in the form of generalized Langevin equations (GLEs), 

^-A(t) = PLA(t) - [ dsK(s) ■ A(x(t - a)) + F(t) (6) 
®b Jo 

where 

F(t) = exp(t(l-P)L)(l-P)Li4 (7) 
K(f) = -(LF(t),A)-(A,A)- 1 (8) 

At time 0, A is within the subspace. Time evolution of A is splitted into the 
dynamics within the subspace and within the orthogonal subspace, which are 
treated explicitly and implicitly, respectively. Effects of the latter on the former 
are accounted for by the last two terms in the right hand side of Eqn. [5] If 
the Liouville operator is anti-Hermitian, one further obtains the generalized 
fluctuation-dissipation relation (GFDR) between the memory kernel and the 
random force term, 

K(t) = (F(t), LA) ■ (A, A)" 1 = (F(t),F(0)) ■ (A, A)- 1 (9) 

Eqn. [5] is mathematically equivalent to Eqn. [3J and assumes a form formally 
analogous to the phenomenological generalized Langevin equation. In principle, 
one can apply the projection formalism to general dynamical systems, and the 
choice of the Hilbert subspace expanded by A and the weighting functions can be 
arbitrary [HE3. However, as Zwanzig pointed out [3], in general the procedure 
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is only a mathematical exercise with no practical usage. To be practically useful, 
one has to choose ip properly. With an invariant measure, 



/(x(t)Mx(0))dx(0) = / /(x(t)Mx(t))dx(f) 

/(x)^(x)dx, (10) 



A and x can be viewed as random variables [3J HJ [5] . For a Hamiltonian system 
near equilibrium, one can choose the Boltzmann factor p(q, p) = exp(— (3H ) / J exp(— (3H )dqdp 
as ip. The conservation of H and the Liouville theorem ensures the choice is 
an invariant measure. The term F(i) has ensemble average < F(t) >= 0, and 
thus indeed behaves as a random force term. The memory kernel and the ran- 
dom force terms are not independent, but are constrained by the GFDR. For 
a general non-Hamiltonian system it is not clear how to choose the measure. 
Phenomenological generalized Langevin equations have been used to model dy- 
namical systems such as financial market fluctuations [16J . However, lacking of 
the GFDR makes model construction very difficult [20] . 

1.2 Projection formalism for non-Hamiltonian systems 

In a seminal paper, Ao showed that one can always construct a symmetric 
matrix S and an anti-symmetric one T, and transform Eqn. Q]mto [TT] . 



M 



rfx 
~dt 



= M-(G(x)+g(x)C(t)) 
= -Vx0(x)+g'(x)C(t) 



(11) 



where, M = S + T, <f> is a scalar function corresponding to the potential function 
in a Hamiltonian system satisfying (d x d<f))ij = (didj 
2S//3. Then S and T arc uniquely determined by 



■9A) 



0, and g'g' T 



8 x [M • G(x)] = 0, (M) + (M)- 1 = 2gg J , 



(12) 



and proper choice of the boundary conditions. One may extend the above 
equation with an auxiliary momentum term, 



-T(x) 



-S(x) 



;'(x)C(i) 



(13) 
(14) 



which reduces to Eqn. [T2]in the limit m —> (p —> 0). 

In [7j, Xing showed that one can map the dynamics described by Eqn. [TTIto 
a Hamiltonian system in the zero mass limit. The proof proceeds in two steps. 
First one can define a Lagrangian so the resultant Euler-Lagrange equation gives 
Eqn. [13] and [14] excluding the dissipative terms (the terms inside the second 
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bracket in Eqn. Q3]). Second following a procedure similar to that adopted 
by Zwanzig |12j . one can replace the dissipative terms by a bath Hamiltonian 
with a large number of harmonic oscillators coupled to the primary degrees of 
freedom x. The bath is initially in contact with a heat reservoir. That is, the 
initial conditions of the bath degrees of freedom are drawn from a canonical 
distribution. The overall Hamiltonian is 



H = 



(P-A(x)) s 
2m 



(x) 



E 

a=l 



N 

E 



1 2 

Paj 



• a«(x)/(VJV<)) 



(15) 



where A is a vector potential satisfying = — -^f- , P = mx+ A = p+ A 
is the conjugate momentum. The last term in Eqn. [15] is the bath Hamiltonian 
, and its form is determined by S [TJ. The Hamiltonian describes a massless 
particle, coupled to a set of harmonic oscillators, moving in a hypothetical n- 
dimensional conservative scalar potential and magnetic (the vector potential) 
field. The mapping permits applying techniques and results for Hamiltonian 
systems to dissipative system [7j 1 1 3j . For the current purpose, the property of 
Hamiltonian dynamics suggests the inner product definition, 



(A,B) 



J AtBexp(-/3iJ)dxdp 
J exp(— f3H)dxdp 



(16) 



Alternatively, one can also replace the integration over p by p. Both defini- 
tions ensure the requirement of invariant measure [14j . As will be clear from 
the following theoretical developments and examples, in real applications one 
needs not to actually perform the mapping. The mapping merely serves as an 
auxilliary tool to derive the projection formulae and the GFDR. 

In this work for simplicity we only consider projecting to a subspace com- 
posed by the first m components of x and the corresponding velocity compo- 
nents. Generalization to collective coordinates is straightforward, and is given 
in Appendix A (see also [2] [17j [18] ) . In the following discussion we denote them 
as X = {xi, X2, ■ • ■ , x m }, and X = {xi, £2, ■ • ■ , x m }. Let's define a nonlinear 
projection operator [2| [5] [18] . 



Ph = 



1 



hp(x, p)5(X - X)5(X - (p - A)/rn)dxdp 



p(X,X) 

where h is an arbitrary function, and 

p(X, X) = / p(x, p)<5(X - X)<5(X - (p - A)/m)dxdp 



(17) 



(18) 
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Then (see Appendix A for detailed derivation), 



PLX 



X, 



PLXi = - 



_l d_ 

m/3 dX 3 

E** 



lnp(X) 



< 



dAk 
dX. 



>x - < 



dX k 



>x 



where, 



P(X) 
<5>x 



P(X) 



rfxexp(-/3^)(5(X-X) 



dxB(x) exp(-/3^)J(X - X) 



(19) 
(20) 

(21) 
(22) 



and we have omitted the bar on the variables. The projected equation of motion 
is, 



d 

ax 



dsmK ji{s)Xi{t — s) 



mFj(t) 



(23) 



where W(X) = — lnp(X)//3 is the potential of mean force, and Ty =< — 
> is the renormalized antisymmetric matrix in the reduced space. The 
memory kernel and the random force are related by the generalized fluctuation- 
dissipation relation, 



< F t (t)F 3 (t') >= Kij(t - t')/{mp) 



(24) 



Taking the ansatz mKji(t) — ^{2^ i 8{t) + Jji(t)) (see discussion below and 
the analytical examples in Supporting text for reasons), and take the zero- mass 
limit, one has, 



d 
0X~ 



y2 / ds^ji{t - s)x(s) 



■mPFj(t) 



(25) 



Where S%j = 7y. Eqn. [55] is in the form of the generalized Langevin equation 
(GLE), and together with the GFDR Eqn. [M]is the main result of this work. 
Physically the terms in the first brackets are related to the direct interactions 
(or fluxes) within the projected subspace and between the subspace and the 
surroundings. That is why we need to introduce the singular part of Kij in 
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general. The terms in the second brackets refer to the retarded interactions 
mediated by the implicit surroundings. Notice that we never actually perform 
the complicated nonlinear transformation prescribed by Ao and by Xing [7] [TT] . 
In the case we project to a 1-D system, the equation is, 

= -— — W(X) - / dsT(s)X(t - s) + m/3F(t) (26) 

with r(i) = 27o<5(t)+7i(*) 



2 Analytical and numerical examples 

Let's first consider an (JV+l)-dimensional quasi-linear system in the transformed 
representation, 

= -((>(x ) - MqXq - * 0x • x- M 0x • x + £ (i) 
= -*-x-M-x-* x0 x -M x0 i +£W 
or in the original representation, 

— ( x ° \ = ( M ° M ° x ^ ( -^o) -*0x-x + £ (i) 
dt\ x J \M xQ M J \ -*-x-#xoa:o+f(t) 

or in the transformed representation, We will project out the degrees of freedom 
x = (xi, • • • ,Xn), and retain only the degree of freedom xo- In the supporting 
text for a 2-D system we derived analytical expressions of the GLEs with the 
above two sets of equations, and showed that for the latter but not for the 
former the memory kernel and the random noise term are indeed connected by 
the GFDR. We also derived the GLE for the general (N+l)-dimensional system. 
This model can be regarded as generalization of the well studied system-bath 
model in Hamiltonian systems [4]. 

In most applications of the projection formalism including previous work on 
Hamiltonian systems, it is impractical to perform the projection analytically. 
Extensive related methodology studies exist for Hamiltonian systems, which 
can be generalized to non-Hamiltonian systems due to the mapping. We will 
leave these for future studies. Here for illustrative purpose we will demonstrate 
the validity of the projection only through a simple chemical network. The 
network is an end-product inhibition motif commonly found in metabolic and 
other biological regulatory networks (see Fig. la) [19] . Each reaction is governed 
by irreversible Michaelis-Menten kinetics, 

1 K m + x 4 K m + x\ + g ^ ^ ^ ^ 

and similar expressions for other specie concentrations. We used v m — l,K m = 
0.5, g = 0.005 in our simulations. Numerical details are given in Appendix B. 

The system is initially at the steady-state. At time 0, the concentration of 
x\ is set to a value xi(0). The relaxation dynamics xi(t) is monitored. We used 



G 




Figure 1: Determination of the GLE parameters, (a) The network with end- 
product inhibition, (b) Potential of mean force calculated from the steady-state 
distribution of the full model, (c) The memory kernel used for fitting, (d) The 
fitted and simulated relaxation curve of x\ with xi(0) = 2 < x\ > ss . We didn't 
fully optimize the fitting. 
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Figure 2: Comparison of the predicted and simulated relaxation functions with 
x\(0)/ < x\ > ss = 1.2(a), 0.8(6), 0.5(c). The model parameters are the same as 
used in Fig.l. 
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the result of one simulation on the full model, that with xi(0) — 2<x\ > ss , as 
the known information to fit the parameters for the GLE Eqn. [26] The term 
< x% > ss refers to the steady-state average. Fig lb-d show the fitting results 
and parameters. The potential of mean force is calculated from the steady-state 
distribution. We modeled the nonsingular part of the memory kernel, 71 with 
a Gaussian basis set. Physically one may understand the fitted memory kernel 
as follows: the change of x\ is propagated to X4 through a series of reactions, 
and acts back on x\ at a later time. The first portion of 71 with negative 
values (as modeled by two Gaussian functions) accounts for most of the effect. 
Some remnant effect propagates one more cycle to act on 11 with an opposite 
sign (inhibition of inhibition), doubled delay time, and reduced amplitude. We 
neglected further higher order effects. In this work we focused on illustrating 
validity of the method, and thus made no effort to fully optimize the fitting. 
We then used the set of parameters to simulate the GLE with different values 
of £1(0), and compared with simulation results of the full model. Fig Id shows 
remarkable agreement even without fully optimizing the parameters, supporting 
the validity of Eqn. [2H 

3 Discussions and concluding remarks 

In this work we developed a generalized Zwanzig-Mori projection formalism for 
dissipative non-Hamiltonian systems. Because of the mapping between a dis- 
sipative non-Hamiltonian system and a conservative Hamiltonian systerm, we 
expect that the large number of existing methods on applying the projection 
method to Hamiltonian systems can be readily applied to non-Hamiltonian sys- 
tems [101 H5 ■ We suggest that an important direction for future researches is to 
develop a number of standard ansatz (function forms for the potential of mean 
force, the memory kernel, etc) for different situations. Analytical esults for the 
quasi-linear systems may serve as the starting points. 

In most applications of the projection formalism including previous work on 
Hamiltonian systems, it is impractical to perform the projection procedure ana- 
lytically. For heuristic purpose we tested the formalism through projecting sys- 
tems with two and three dimensions to one dimension. The formalism, however, 
can be applied to arbitrary high dimensional systems. In the supporting text, we 
derived the analytical formula for projecting an (A^ + l)-dimcnsional quasi-linear 
system to one dimension. In real applications, information about a system is 
usually incomplete. Instead one can obtain the potential of mean force, the ma- 
trices S + T and 7 from available data, following the well-established procedures 
developed for Hamiltonian systems jT7] [20] [21]. On multiplying (S + T) _1 on 
both sides of Eqn. [531 one transforms back to the original representation with 
direct physical meaning for each term. 

Given the broad range of problems Eqn. [TJ describes, we expect that the 
method discussed in this work will find applications in many fields of science. 
Here we will discuss its usage and implications in the field of mathematical 
modeling of biological networks, or systems biology in a broader sense. 
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On modeling a complex dynamic system, a common problem is that there 
is insufficient information to identify a large number of parameter values in the 
model. For example in the field of systems biology, one frequent criticism of 
mathematical modeling is that one sometimes attempts to fit several data point 
with dozens or even hundreds of parameters. Fortunately analysis shows that 
for many systems the quality of the data fitting is usually largely affected by a 
small number of composite parameters, and insensitive to others [22] . A math- 
ematical model with many variables and parameters is also computationally 
expensive. The present projection formalism provides a systematic method to 
construct a reduced model with a small number of variables and parameters im- 
portant for the dynamics under study. It also provides a method for performing 
multi-scale modeling, using information obtained from finer level simulations 
for constructing a more coarse-grained model. 

Network robustness is a related problem. It has been suggested that robust- 
ness is a general property for many biological networks. As illustrated by Barkai 
and Leibler using the bacterial chemotaxis network model, a system is robust if 
its function is determined by one or a small number of composite quantities, and 
values of the latter are insensitive to variation of most control parameter [23] . 
The projection method provides a natural framework for quantifying network 
robustness under perturbations 

There are extensive discussions on whether one can use the parameters mea- 
sured in vitro on modeling processes in vivo [24) . For the latter, there are 
inevitably interactions between the subsystem one examines and the remaining 
part of the living system, which are not present in the system in vitro. The 
projection method provides a theoretical explanation why in general the two 
sets of parameters should be different. Even in the case that the memory kernel 
can be approximated by a delta function so the retarded memory term 7y = 0, 
the interactions between the subsystem one models and the remaining degrees 
of freedom affect the dynamics of the subspace through renormalizing the model 
parameters [55]. The projection method can suggest a controlled approximation 
linking the two sets of parameters. 

[A] Here we derive a more general projection formula, which reduces to Eqn. 
[2H1 The procedure resembles that of Lange and Grubmiiller [T7]. Let's suppose 
that we project to a manifold, 

c = f (x), c = Vf (x) • x = Vf (x) . (p - A)— (28) 

TO 



The Liouville operator is 

N 

v ^ / un u un u 

— ' \ fin: f^Ti rlnf: rl<n: 



A 'dH d dH d 



t=l 

N N, 



EE 



dpi dxi dxi dpi 

f dH d dH d 



(29) 



Notice that Cj and dj have no explicit dependence on the bath variables. We 
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define the projection operator as, 

Ph = 1 [ hp{x, p)5(c - f)5(c - Vf (x) • ±)dxdp 
where h is an arbitrary function, and 

p(c, c) = J p(x, p)5(c — f)<5(c — Vf (x) • x)dxc?p 
The projection of Cj is simple since it is still within the subspacc, 



N 



LC 3 = J2 



dH dcj dH dcj 
^ V dpi dxi dxi dpi 

The projection of dj is given by, 

N 



dH df. 
dpi dxi 



3 _ 



= Ci 



PLc 3 = ^-ry J dxdp5(c - f)5(c - Vf (x) • x) 

exp(-/3H) 



dH ddj dH ddj 



N 



dpi dxi dxi dpi 

dxdp5{c - f)S(c - Vf (x) • x) 



ujJi kj J-* i u a, i 



N 



= E 



i 



{ Pp{c, c) 



d_ 

dpi 



S(c - fWc - Vf(x) • (p - A)-) 

m 



dxdp 



N 1 



exp(-^)|i 



d 

dx. 



S(c - fWc - Vf(x) • (p - A) — ) 

m 



dxdp 
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N 



m oxi oc k 

N 1 

^«c,c) J 

dxi dck 



5(c- Vf(x)-(p-A)-) 

m 



1 8f 

dxdpcxp(-l3H) — -^- 
m oxi 



\ - dfk d 



S(c - f)6(c - Vf(x) • (p - A) — ) 

m 



dxi \m 



{E^^V/ fe -( P -A) 

d 
dc k 



S(c - fWc - Vf(x) • (p - A)-) 

m 



6(c - f)<5(c - Vf(x) • (p - A) — ) 



m 



E 



]_df±dfk 
- 1 m dxi dxi 



c,c)^dc k J 



6(c - t)S(c - Vf (x) • (p - A) — ; 



m 



m ^ 



df, (1 



dxi dxi [rn^^ k ^ ^ 



df k d (I 



-yii • (p - a) 



(33) 



dxi dxi \m 

To derive the above expression, we performed integration by parts, and used 
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the relations, 



V x S(c-/) = V x fd f 6(x-f) = V*fd c 6(x-f) 
V x <5(c-V x /-(p-A)/m) = V x (V x /-(p-A)/m) 

d 6 6(c - V x / • (p - A)/m) 

V P <J(c - Vx/ • (p - A)/m) = -Vx/ 

m 

ac<y(c-V x /-(p-A)/m) (34) 

We have neglected possible surface terms while performing integration by parts. 
For example, If Xi represent concentrations, one expects that p(0, p) w so 
mathematically one can extend the integration to x — > — oo. Otherwise Eqn. 
[1] is not a good representation of the system dynamics in the first place. With 
fj = Xj = Xj the final result of Eqn [33J reduces to Eqn. 

[B] For the full model, the Langevin equations were propagated by 



Xi{t N ) = Xi(t N -!) + Atd (x) + y/2gAt/j3Ci(t), (35) 

where Q(t) is generated from a Gaussian distribution with zero mean and unit 
variance, and we have set j3 = 1 throughout the work. We used At = 0.005 
in all calculations. The potential of mean force was obtained from the steady- 
state distribution histogram. All the relaxation curves are average over 40000 
trajectories. 

To solve Eqn. l26lnumericallv. we first integrate both sides from ti — (i— 1) At 
to U+i = iAt, 

= -/* dt'^-W(X(t'))-2 f dt'T Q X 
Jti-! ox 3 j til 

dt' I dsT x {s)X{l! -s)+mf3 / dt'F(t') 

ti-i JO Jti-i 

d 



dX j 



W(X(t(_i))At - AtToiXiti) - X(U-i)) 



- ri((* + l/2)At)(X(ti_ fe ) - X(U-k-i)) 

k=0 

+m/3 [ dt'F{t') (36) 



We used the method of Berkowitz et al. to generate the random forces 
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Then one realization of the random force is, 



\ dtf{t) = 

JiAt k=1 \ 



J K {2irk/P) 



Cbk 



(sin(wfc(i + l)Ai) — sin(u;fciAt)) 
(cos(w/c(i + l)At) — cos(uJkiAt)) 



(37) 



where ^ a k and ^ are random numbers drawn from independent normal Gaus- 
sian distributions, ujk = 2irk/P, and P = MAt is the time interval that the 
random force doesn't repeat. The spectral density is determined by the mem- 
ory kernel through the Wiener-Khintchine theorem, 



Jr-(w) 



dtT cos(wt) 



(38) 



In all the simulations, we used At = 0.1, M = 4000, and all results are averaged 
over 40000 trajectories. We use the following fitting parameters for the mem- 
ory kernel: 70 = 28; 71 is represented by four Gaussian functions in the form 
Xi exp[~0.5((t-6 i )/a l ) 2 ], with A = (-5.2, -3, 1.56, 0.9), a = (1.5, 1.5, 1.5, 1.5), b = 
(8,10,16,18). 

We thank Yan Fu for making Fig la, and Drs. Lange and Grubmuller for 
discussions on their work. 
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Supporting information 



A Linear and nonlinear projection 

As Zwanzig pointed out, the term "nonlinear projection" is actually misleading. 
Both linear and nonlinear projections work with linear Hilbcrt space. They differ 
only by the size of the Hilbert subspace. The formal theoretical development 
discussed in the section "Summary of the Zwanzig-Mori formalism" apply for 
both cases. As an example, let's consider a 1-D Hamiltonian system. If one 
chooses the subspace as expanded by {xi,px} (the quantity A in the main 
text), the projection is linear. The projection is called "nonlinear" if the basis 
is expanded by including higher order functions of x\. Especially, Zwanzig 
presented a choice of the subspace by including all the possible function forms of 
the projected degrees of freedom. The Hilbert subspace has infinite dimension. 
For this specific example, the basis is composed of {x\,x\,--- ;pi}- In [T5] , 
we demonstrated the equivalence of the two procedures with an explicit model 
system. 



B About the term (A,A)~ l 



Both the memory kernel and the GFDR contain the term (A, A)^ 1 . Here we will 
use the above mentioned 1-D system to derive the needed elements of (A, A)^ 1 . 
Generalization to multi-dimensions is straightforward. For a Hilbert space with 
basis {xi,x^, ■ ■ ■ , x™,pi}, notice that (x\,pi) = 0. Therefore, the matrix (A, A) 



has the block form, 



( A A) 



( {x\,xi) 







[x n j X n ) 



(39) 



(pi.Pi) / 



Especially the momentum-containg term is block diagonal. In the projected 
equations, one needs only the momentum-containing term [(A, A)~ l j n ^ n+ ii 
which is (pi,pi) _1 = 1/m. 



C 2D example 

Here we use an analytically solvable model to demonstrate how the transfor- 
mation reveals some hidden relation between the memory kernel and the noise 
term, 

xi = -anxi - ai 2 x 2 + 5iCi(*) 

±2 = -anxi - a 22 x 2 (40) 
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or 

x = -a-x + g<C (41) 
C.l Direct projection 

First, let's integrate out degree 2 directly. Notice that, 

x 2 {t) = cxp(-a 22 i)cc 2 (0) + / exp(-a 2 2T)(-a 2 ix 1 (t - t) + g 2 ( 2 {t - T))dT 

Jo 

= exp(-a 22 i) (x 2 (0) + —xi(0)) - ^x x {t) 

V a 22 / 022 



/ exp(-a 22 r)(- — ±i(t - r) + ,g 2 

Jo «22 



and, 



C 2 (<-r))dT (42) 



ii = — 011X1 + / {[exp(-a 22 r)ai 2 a 2 i}a;i(t - t)g?t 
Jo 

+ {-ai2 cxp(-a 22 i)a; 2 (0) + #iCi(*) 

~y exp(-a 22 r)ai 2 .g 2 C 2 (i - r))dr| (43) 

012021 \ M 

an xi(t) 

0,22 1 



rt 

I / I 1 ! I I ■ . 1 I 

ii(< — r)dT 



/ f , ^ai2a 2 il 
/ <^ exp(-a 22 r) ) ; 

Jo I Q22 J 



-Oi2 exp(-a 22 i) ( x 2 (0) + — x x (0) ] + giCi (t) 
\ a 22 J 

- J exp(-a 22 r)ai 2 5 2 C 2 (i - r)drj (44) 

Both eqns [43] and |44] have the generalized Langeven equation form, with the 
memory and noise terms identified as those in the curled brackets. However, 
the relation between the memory kernel and the noise is not transparent in any 
of these two equations. This is a major obstacle for applying the generalized 
Langevin equation formalism in general dynamic systems. 

C.2 Projection after transformation 

Let us transform EqnHHlby multiplying M = S + T on both sides, 

= -Mi\±i - Mi 2 x 2 - 0nxi - 4>i 2 x2 + £i(t) 

= -M 2 1±1 - M22X2 - 4>12Xl - 4>22%2 + &(t) (45) 
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or by introducing the zero mass limit, 

±1 = pi/m 
X2 = Pi/m 

Pi = -MnPl - —M 12 p 2 ~ 011^1 - 012^2 + S,l(t) 

m m 

p 2 = — -M 2 ipi - — M22P2 - 021^1 - ^222:2 + (46) 



m m 



where, 



= M • a, £ = M • gC 



The matrix is symmetric, and one can obtain the analytical expression of M 
following the procedure by Kwon et al. 

C.2.1 Approach 1 

Taking the Laplace transform on both sides of Eqn. [45j 

= - Mu (s*l -Xi(0)) - Mi 2 (sX 2 - £2(0)) - 011*1 - 012*2 +£1 
= -M 2 l(sXx -Xl(0)) - M 22 (sX 2 - X 2 {Q)) - 021*1 - 022*2 + £2 

Solve x 2 from the second equation, and substitute into the first one, 
n (m M 12 M 21 \ ( 02iMi2\ . 

_ 22 M 12 \ / A/ 2 l(5*l -X!(0)) +021*1 + M 22 X 2 (0) +£ 2 \ 
M 22 ) \ SM 2 2 + 022 sAf 2 2 + 022 J 

~ M l2 ~ 

H1 - M22 

f M M 12 M 21 \ , . ( 021012 N - 



Al 



22 

L / M12 _ 012 \ SM 22 + 022 
M 2 2 022 / SM 2 2 + 022 

612 _ M12 \ / M 21 (5^1 -Xi(0)) +021X1 
i<22 M22 / \ sAf22 + 022 

_ 22 M 12 \ M22X 2 (0)+| 2 r _ Mi 2 - 
M 22 ) SM 22 + 2 2 M 22 
Mi 2 M 2 i\ _ / 021<?12 1 

Af 22 ) ^ S:El ~~ ^H )) ~~ I ^ n ) Xl 



•>22 



M 12 4>i2\ f M 2 i 012 \ s5i-2;i(0) 



M22 022/ V-^22 022/ S + (f) 22 /M[ 



J 22 



012 M i2 \ 021^1 (0) + 022 ^2(0) + I2/M22 . r Ml 2 ~ 

?1 ~ TT" 42 (40 



M 22 / s + (j> 22 /M 22 M 2 
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Therefore, 



, , <?>12<Pl2 

hi H 7 

022 



a?i(t) 



\M 2 2 022/ M 2 2 



' feT 022, / J£a _ £i ) 6(t _ T) L (48) 

M 22 \M 22 022/ J 

Comparing the above final expression with the projected equation of motion 
derived in the main text (Eqn. ), one can identify the terms in the four curled 
brackets correspond to the derivative of the potential of mean force, the memory 
term, and the random force term. 

As one important observation, while Eqns. ?? and 1451 arc different only by 
a matrix transformation, there is no simple relation between Eqns. 1441 and [48l 

C.2.2 Approach 2 

Here we will start with Eqn. 1461 The derivation is mathematically more com- 
plex, but can provide further understanding of the zero mass limit, 



u(t) = e Kt u(0) 



where, 



u = (x,p) T 



/ e KT v(t - t)<1t (49) 
Jo 



K 







1 



^22 — 7^-^22 




-~M 2 lPl ~ 012X1 +6(t) 



exp(Kt) = 2 2 , 1 o 1 o (50) 

y a_022m - a + 022 m 2 a -P+ ~ 2 a +y- J 
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M 22 + \ -k<i> 22 m 



Mf 2 



2M- 



22 



M 22 - \] -4022m + M| 2 w 2<p 22 m/M 2 



21 



Therefore, 
x 2 {t) -- 



\(a + [3 + - a_/3_)a; 2 (0) + (a + - a_)p 2 (0) 



rri 



+ [ dT(a+(r) - a_(r)) 
Jo 

-<tH2Xl(t - T) + &(t - T)] 

022 + i / 022 +\ 



M 2lPl (t-T) 



/ $22 . 

dr ( e M 22 



1 



M 22 



— M 2lPl (t-T) 
m 



-4>i 2 xi(t - t) + &(t - t)} 

022 i ' / 022 +\ 

= e ~T^x 2 {0) + TT - e"^ f p 2 (0) 

Moo \ / 



+ 



e 
M 22 

022 



m 22 t m 



e m 



p 22 
m 



Moo / M- 



1 



1 



M 22 J M- 



i 22 



[-4>12X\{t)\ 



$22 . 

dre M 22 



dre' 



6(t-' 



[-1 


(M 21 


_ 012 \ 


m 


\M 22 


022 / 




(M 21 


012™ 


m 


\M 22 


M| 2 



-0i 2 zi(O)] 



pi(i-r) 



M 22 



022 f ! 022 + 



e M 22 £ a; 2 (0) + 



M 22 



: P2(0) 



<?12 
022 



dre M 22 T 



e M 22 
1 /M 2 i 



■^*i(0) 

022 



P)Pl(t-T) 



m \M 22 <^>22 



JO 



m M 22 M 22 



(52) 



22 



p 2 (t) = {a-(j)2im - a + (t) 2 2m)x 2 (0) + - ^a + f3-)p 2 (0) 



dr 



-—M 2 lPl(t-T) 

m 



-4>12Xl(t - T) + &(t ~ T)] 
rt 



ip-t m(p22 



M- 



-X 2 



22 



(0)+ / 
Jo 



dr e 



T — g M 22 



$P- T 922m 
M| 2 



-M 2 ipi(t - r) - 0i 2 a;i(i - r) + & (* - r) 



„^22_ f m©22 ,„n ™ / M 22 j 

_ e M 22 t _2j££ X2 (o) / . — ■ 



I 



+ / rfre 



M 22 

«22 - 



A/. 



22 



(e~ — *- e ~*S*J [-^12x1(0)] 



-M 2 ipi(t-r)+&(*-T) 



»22 - 

- / rfre M 22 



^22 



M 22 \M; 



M 2 i 



'12 



22 \-<W22 </»22 



pi(i-r) 



, <^22TO 



(53) 
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Then, 
Pi(t) 



M llPl (t) - (j> llXl {t) + ^(t) 

to 

M12 f -p2- t m(f>22 rrt . 
— e M 22 — - — X2{0) 



rn 
/■i 



m 



M22 



M- 



22 



(e-^-fT^ 4 ) [-0i 2 Xi(O)] 



f dre-^ T \--M 2lPl (t - t) + 6 (t - t) 
Jo I m 



f , _^22_ 

- / dre M 22 
^0 



m 22 vm 22 22 y v y 



[ ^22 -i , v 1 __*22_ + , . ^22 j 

-012 < e "22 z 2 (0) + — — e M 22 p 2 (0) - e M 22 



?>12 
022 



M 22 

dre~ M 22 T 



-— «i(0) 



.'22 



1 /M; 



TO \M 22 022 



099 / 



_ M22 

are m 



1 M 21 1 

- T) + 



Pll 



m M22 

'12012 



M- 



22 



■&(t- 



;»22 



Xl{t) 



M nPl (t) 

TO 



21 



M 22 



022 /"* . -*jOL T ( Ml2 012 
m J \M 22 022 

/■* M12M21 _M22 T \ 

+ / dr ^ — e m Pi(*- T )f 

Jo 171 ) 



W-r) 

©22/ 



M- 



WO) + &(*)-— fdre-^Ut-r) 
JW22 m y 

y m 2 2\m 2 2 022 y ^ 'J 



'19 _2*22_f 
■ e M 22 

22 



Noticing, 



then one obtaines 



/* _ M 2 2 . 

lim / (ire "» ' 
m-K) J 



Ti(t-r) « ft(t) 



TO 



M; 



22 



(54) 



(55) 
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C.2.3 Proof of the generalized F-D relation 



To confirm the generalized nuctuation-dissipatin relation, let's consider that 
t' < t. We have, 



Then, 



< PiXi > 

<pI > 

< Xiij{t) > 
<&(W) 



< (folXl + <j)22X2) 2 > 





m 


(Mij + Mji)5(t - t') 



'22 + |p 



e 2- 



-±y T -V0V</>-y 



= <P22 



Mi 



(56) 



s: 



$22 



2M12 



»12 



M| 2 M22 



Ut-r, 



•)(r- 



022 M i2 



Mf 2 



t' 



2 / dr' e -^^' +t - t ') 
./o 



622 M12 
M? M 2 



Mo 



= -2 



j Mi 



"i 2 M|. 



M 2 2 2 



M 



22 



20 



22 



e m 2 2 (* +*) _ e M22 (* * ) 



"M 12 


012 


2 


M22 


022 _ 


022 



g M22 (* +t ) — g M22 (* * ) 



Mi 

m" 



_ $22 
— g M 22 



(t-f) 



-&(*') 
22 

'22 Mi; 

M| 2 



__022_. 

dre M 22 



622 M12 
M 2 



'12 



M 



22 



M 22 
1 



Ut - r) > 



(M21 - M12) 1 - -5(t - i') 



^ (i_t 'U 2 2 I 



-g M22 



M 



12 



til] ( 



' M 



12 



M 



\M 2 2 022/\M 2 2 M 22 / 



^ fl-i 



«5(t - t 7 ) J (58) 
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M12 (j)\2 



. -fa 



dre M 22 T 



12 



hi- 



22 <^22 



M12 _ M2l\ 1 

M 22 M 22 



M| 2 



|5(t-f) 



22 



6(*'-T) > 



(59) 



e -^(*+*') f ^ gig 



M- 

XF^ " P ) < (^21^1 (0) + ^22^(0) + <to(0)) 2 > 

A/22 022/ 



*23- (t+t') / M 12 4>12 



AL 



22 f22 



^22 



(60) 



Bu summing the above expressions, one has the generalized fluctuation- 
dissipation relation, 

< F^F^t') < V \ = K u (61) 

One can not exchange the order of the limits m — > and t — >• 0. That is, 

1 m 



(62) 



(63) 



lim / dre-^«y(t-f-r) = ^J(t-f) 

D Projection with a quasi-linear system 

Let's consider an (N + l)-dimensional quasi-linear system, 

= -^(z )-M a;o-*ox-x-Mo x -x + £o(i) 
= -* x - M x - * x0 x - M x0 i + 

Then, 

x(t) = e- M "-**x(0)+ fe""" 1 * 

Jo 

(-M- 1 • * x0 x (t - r) - M- 1 • M x0 i (t - r) + M" 1 • £(t - r)) dr 
= e- M_1 -**x(0) + e - M_1 -** . $ 1 . $ x02;o (0) 

+ / dre-^^M- 1 • £(i - r) - IT 1 • * x0 x (t) 
Jo 

- / dTe- M "* T • * x0 + M- 1 • M x0 ) x (t - t) (64) 

Jo 
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x(i) = -1VT 1 • * • 



and, 
= 



. e - M_1 -* t x(0)+e- M_1 -* t -*- 1 -* xO x o (0) 

+ f dre-^' ^M- 1 ■ e(i - r) - * 1 • * x0 x (t) 
Jo 

- / dre- M "-* T • * x0 + M 1 • M x0 ) x (t - r) 

Jo 



-M- 1 • * x0 x - M- 1 • M x0 i + M- 1 • 
= -M- 1 • * • e- M_1 -**x(0) - e - M ~ Km 



M 



-l 



- / dre- M 1 * T • M- 1 • * • M- 1 • £(i - r) + M~ 1 • f (t) 
Jo 

- / dre- M "-* r • M- 1 • (* x0 + * • M- 1 • M x0 ) x (t - r) 
Jo 

iv/r-1 i\/r 



$ x0 X (0) 

dTe _M_1 -* T • M~ 1 • * • NT 1 • f (i - t) + M~ 1 • 





(65) 



-<£(ar ) - M x - *Ox ■ x - M 0x • x + £ Q (t) 

- {(j>(x ) - *ox • * 1 • *xoa;o(t)} 

- / drx a (t - t) {2 [M - M 0x • M- 1 • M x0 ] 6{t) 
Jo 

+ [M 0x • M 1 • * - *ox] • e - M " 1#r • M- 1 • (* x0 + * • M- 1 • 
+ {^o(t)-M 0x -M- 1 ^(i) 

+ [M 0x M 1 * * x] • / dre- M " 1 * r M- 1 • £(i - r) 

Jo 

+ [M 0x M 1 * * 0x ] • e- M "-**x(0) 
+ [M 0x • M 1 • 



M x0 )} 



<|. <h :x .< XI 1 •<!» • 



'xo^o 



.(0)} 



(66) 

The terms in the curled brackets are the derivative of potential of mean force, 
the memory kernel, and the random force terms, respectively. 
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